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Abstract 



We give an expository review of applications of computational algebraic statis- 
tics to design and analysis of fractional factorial experiments based on our recent 
' works. For the purpose of design, the techniques of Grobner bases and indicator 

c/3 . functions allow us to treat fractional factorial designs without distinction between 

regular designs and non-regular designs. For the purpose of analysis of data from 
fractional factorial designs, the techniques of Markov bases allow us to handle dis- 
crete observations. Thus the approach of computational algebraic statistics greatly 
] enlarges the scope of fractional factorial designs. 
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g : 1 Introduction 
o 

Application of Grobner bases theory to designed experiments is an attractive topic in a 
relatively new field in statistics, called computational algebraic statistics. After the first 
^ I work by Pistone and Wynn (|2l]) this topic is vigorously studied both by algebraists and 
?H I statisticians. These developments are stimulated by advancements in algebraic algorithms. 

By recent algorithms some practical computations are becoming feasible for statistical 
applications. For these backgrounds see |5]. 

In this paper we revisit some fundamental results in this field, mainly in the treatments 
of two-level fractional factorial designs. The most important part of the long history 
of studies of two-level fractional factorial designs is the theory of regular designs. As 
explained in many classical works (e.g. [8], [9]), properly chosen regular fractional factorial 
designs have many desirable properties, mainly due to orthogonality and balancedness of 
designs. In addition, an elegant theory based on the linear algebra over GF{2) is well 
established for regular two- level fractional factorial designs (e.g. |2Uj). 
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On the other hand, there are still many open problems on general structures of non- 
regular fractional factorial designs, except for some specific designs such as Plackett- 
Burman designs. In computational algebraic statistics, designs are simply characterized 
as the solutions of a set of polynomial equations. Therefore we need not distinguish 
between regular designs and non-regular designs. In fact, this algebraic treatment yields 
many new results. For example, concepts defined for regular designs, such as resolution 
and aberration, can be naturally generalized to non-regular designs in the framework of 
computational algebraic statistics. 

The construction of this paper is as follows. In Section |2] we review some algebraic 
definitions for handling fractional factorial designs. In Section [3] we present a method of 
representing the confounding relations between factor effects algebraically, which is one 
of the results in [23]. In Section H] an indicator function defined in [15] is given. The 
indicator function is a valuable tool to characterize non-regular designs. In Section [5] 
we show a simple result related to the indicator function when new factors are added or 
when interaction effects are formally considered as factors. In Section E] we consider the 
problem of selecting optimal designs. In Section [7] we discuss Markov chain Monte Carlo 
approach for testing factor effects, when observations are discrete random variables. We 
end the paper with some discussions in Section [HI 

Throughout this paper, we use terminology of Grobner bases theory without definition. 
In particular, we omit explanations for term orders and division algorithms. For these 
notions, see yil] or [T], for example. 

2 Design ideals 

Consider fractional factorial designs of m controllable factors. We assume that the levels 
of each factor are coded as elements of a field which is a finite extension of the field Q 
of rational numbers. For the case of factors with two levels we can take = Q. However 
for the case of factors with more than two levels, it is advantageous to code the levels by 
complex numbers ([22], [23]). In this case K contains some complex roots of unity. 

A fractional factorial design (without replication) is identified with a finite subset of 
K"^. In computational algebraic statistics, this set is considered as the set of solutions of 
polynomial equations, called an algebraic variety, and the set of polynomials vanishing on 
all the solutions is called an ideal. For the rest of this section we only consider the case 
of two- level factors. For more general case see [21] . 

The full factorial design of m factors with two levels is expressed as 

V = {(Xi, ...,Xm)\x\ = --- = xl = l} = {-1, +1}"^, 

where we write —1 and 1 as the two levels. We call a subset T (I'D a. fractional factorial 
design. Let K[xi, . . . , Xm] be the polynomial ring of indeterminates Xi, . . . ,Xm with the 
coefficients in K. Then the set of polynomials vanishing on the points of J-" 

HJ^) = {f ^ -^[a^b ■■■,Xm] I f{xi, Xm) = for all (xi, . . . , Xm) G J"} 
is the design ideal of J-". 
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An ideal / C K[xi, . . . , Xm] is generated by a (finite) basis {gi, . . . , gk} C I if for any 
/ G / there exist polynomials si, . . . , G K[xi, . . . , Xm] such that 

k 

... 5 Xm) ^ ^ "51(2^1) . . . ; Xiji)gi(^Xi, ... 5 2^m). 

The above Si, . . . , are not unique in general. We write / = {gi, . . . , g^) if / is generated 
by a basis {gi, . . . , ^f^}. For example, for the full factorial design of two factors with two 
levels (2^-design), the design ideal of X' = { — 1, +1}^ is written as 

IiV) = {xl-l,xl-l). 

Every ideal has a finite basis by the Hilbert basis theorem. In addition, if {gi, . . . ,gk} 
is a basis of I {J-"), then J-" coincides with the solutions of the polynomial equations gi = 
0,...,gk = 0. 

Suppose there are n runs (i.e. points) in a fractional factorial design T d T). A 
general method to derive a basis of I{J^) is to make use of the algorithm for calculating 
the intersection of the ideals. By definition, the design ideal of the design consisting of a 
single point, (ai, . . . , a^) G {—1, +1}™', is written as 

{X\ 0-1, ... , Xjyi Oim) C K\Xi^ • • • 1 Xm\- 

Therefore the design ideal of the n-runs design, = {(oji, . . . , aj^), i = l,...,n}, is 
given as 

n 

I{J^) = f]{xi - an, . . . ,Xm- aim)- (1) 

i=l 

To calculate the intersection of ideals, we can use the theory of Grobner bases. In fact, by 
introducing the indeterminates ti, . . . , t„ and the polynomial ring K[xi, . . . , Xm, h, ■ ■ ■ , tn], 
equation ([T]) is written as 

i{j^) = rnK[xi,...,Xm], 

where 

/* = {ti{xi - an), ti{xm - dim), i = 1, . . . , n, ti H h t„ - 1) (2) 

is an ideal of K[xi, . . . ,Xmitii ■ ■ ■ ■,tn\- Therefore we can obtain a basis of I{J^) as 
the reduced Grobner basis of /* with respect to a term order satisfying {ti, . . . >- 
{xi, . . . This argument is known as the elimination theory, one of the important 

applications of Grobner bases ([Hj). 

Example 2.1 (A 2jj^^ design). Consider the design known as the orthogonal array L^{2'^) 
of resolution III with the defining relations 

X3 = —X1X2, = —XiXi, Xq = —X2X4, Xj = X1X2X4 (3) 

given as follows. 
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For this design a basis of I{J^) is obtained by omitting elements containing the indeter- 
minates ti, . . . , tg from the reduced Grobner basis of For the lexicographic term order 
with Xi>~ ■ ■ ■ >~ x-j, the reduced Grobner basis is given as 

{x-j — 1, Xq — 1, X5 — 1, X3 + X5X6, X2 + X5X7, Xi + XqXt, Xa — X5XQX7}, (4) 

while for the graded reverse lexicographic term order, the reduced Grobner basis is given 
as 

1 ^2 1 ^2 T ^2 1 ^2 1 ^2 T ^2 1 

X2X3 + Xi, X4X5 + Xi,XqX7 + Xi,XiXs + X2, X4XQ + X2, X5X7 + X2-, 

X1X2 + X3, X4X7 + Xs, X^Xq + Xs, X1X5 + X4, X2XQ + X4, + X4, (5) 

X1X4 + X5, + X5, X3X6 + X5, X1X7 + xe, X2X4 + xe, X3X5 + Xq, 

XiXq + X7, X2X5 + X7, X3X4 + X7}. 

Hereafter, we write a monomial of the indeterminates 
It is sufficient to consider a = (ai, . . . ,am) € {0, 1}™ since the two levels are coded as 
{— 1, +1}. The results in Example 12. II indicate the relation between the defining relation 
and the design ideal for regular fractional factorial designs, i.e., designs obtained from the 
defining relations such as (j3]). In fact, for the design J-" obtained by the defining relation 

x^^ = Q, Q e {-1,1}, i=l,...,S, 

the design ideal is written as 

= (x? - 1, . . . , - 1, x"i - Ci, . . . , x"= - Cs). 

For example, the design ideal in Example 12.11 is also written as 

/(J-") = (x^ — 1, . . . , X7 — 1, X1X2X3 + 1, X1X4X5 + 1, X2X4X6 + 1, X1X2X4X7 — 1). (6) 

As we see here, an obvious basis of the design ideal of a regular fractional factorial design 
G V consists of defining relations in addition to 1. Also for non- regular 

designs we can consider the set of polynomials (in addition to xf — 1, . . . , x^ — 1) which 
forms a basis of I{J^)- This set of the polynomials is called a set of defining equations of 
J-" in [15]. This is a generalized concept of defining relations from regular to non- regular 
designs. Note that the above obvious basis of a regular design is not a Grobner basis in 
general. In fact, the right hand side of ([6]) is not a Grobner basis for any term order. 
In the arguments above we used the elimination theory as a general method to obtain a 
basis of the design ideal and obtained a reduced Grobner basis as a result. However, it is 
important in itself to obtain a Grobner basis, which we see in the next section. 
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3 The confounding relation and the ideal member- 
ship problem 

In this section we see from the Grobner bases theory that the confounding relation can 
be generahzed from regular to non-regular designs and expressed concisely. This is one 
of the merits to consider the design ideal I{J^). In fact, the problem of judging whether 
two factor effects or interaction effects are confounded or not is equivalent to the ideal 
membership problem, which is solved by calculation of a Grobner basis of the design ideal. 
We give an overview of this fact. For details see |2l] or [16]. 

First we give some necessary notation and definitions. LTt-(/) denotes the leading 
term of the polynomial / G K[xi, . . . ,Xm] with respect to the term order r. For an 
ideal / C K[xi, . . . ,Xm\, we write the set of the leading terms of the elements in I as 
LTt-(/) = {LT^(/) I / G /}. A monomial is called a standard monomial if it does not 
belong to LTt-(/). From the definition of Grobner basis, the set of standard monomials is 
also characterized as the set of monomials which is not divisible by any leading term of 
the element of the Grobner basis with respect to the term order r. We write the set of 
standard monomials {x^ | x** ^ LTt-(/(J-'))} as Estr{J^)- The following is a basic theorem 
(Proposition 1.1 of [25j) in the theory of Grobner bases. 

Theorem 3.1. K[xi, . . . ,Xm]/I{J^) is isomorphic as a K-vector space to Span(EstT-(J-')). 
EstT-(J^) is a basis of this vector space. 

Estr(v^) represents one of the identifiable sets of main and interaction effects under 
the design J-" and the number of the monomials in Estr(J^) is always the same as the run 
size n for any r. 

Example 3.1. For the two reduced Grobner bases in Example \2.1l EstT-(J-') is written as 
follows. 

• lexicographic: {1, X5, Xg, 0:7, x^Xq, X5X7, XqXj, x^XqXj} 

• graded reverse lexicographic: {1, xi, X2, X3, X4, X5, xq, X7} 

Now we consider the relation between the confounding relation and the design ideal. 
We identify a monomial x*^ with a main effect if Y^iLi "^i = ^ind a two-factor interaction 
effect if Yl^i '^j = 2 and so on. Then two main or interaction effects are confounded in 
the design J-" if x'^^x**^ is identically equal to +1 (or —1) for all the points in x G J-". 
The confounded effects cannot be estimated simultaneously. Therefore a design has to be 
chosen such that main effects are confounded with higher order interaction effects under 
the hierarchical assumption. This is the concept of resolution. The confounding relation 
is expressed in terms of the design ideal as follows. 

Proposition 3.1. Let c G { — 1,+1}. Then the following two conditions are equivalent. 

(i) x^^x^^ = c for all x G (ii) x^^ - cx^^ G 

In general, we have to calculate a Grobner basis to judge whether a given polynomial 
belongs to a given ideal or not, i.e., to solve the ideal membership problem. 
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Example 3.2. Consider the design in Example \2.1\ again. Since the defining relation 
X3 = —X1X2 exists, the main effect of X3 and two-factor interaction effect of xi and X2 
are confounded. For the reduced Grobner basis X1X2 + G I{J^) is obvious since the 
basis includes X1X2 + ^3 . On the other hand for the reduced Grobner basis it is shown 
as follows 

X1X2 + X3 = {Xi + XqX-j)x2 - {X2 + X5X7)XfiX7 + {Xj - l)x5Xfi + {x^ + X^Xq). 

For the case that the factor has s (s > 2) levels, similar relation as in Proposition 13.11 
holds if we code the levels as the sth root of unity. For example of three-level factors, the 
levels are coded as {1, cj, cj^}, u = exp(27rz/3). See [23] and jl] for details. 

4 Indicator functions 

In this section, we introduce an indicator function, which is defined by [15]. The indicator 
function of a design C P is a polynomial / G K[xi, . . . , Xm] satisfying 

, . _ J 1, if X G J" 
^^""^ ~ \ 0, if xG I?\ 

The indicator function has a unique square-free representation under the constraint = 
l,i = 1, . . . ,m, and in a one-to-one correspondence to the design J-". Many important 
results in the field of computational algebraic statistics are related to the indicator func- 
tion. For example, some classes of fractional factorial designs can be classified by the 
coefficients of their indicator functions. It is also shown that some concepts of designed 
experiments such as confounding, resolution, orthogonality and estimability, are related 
to the structure of the indicator function of a design. Since the indicator function is 
defined for any design, some classical notions for regular designs, such as confounding 
and resolution, can be generalized to non-regular designs naturally by the notion of the 
indicator function. See [U] or [27] for details. 

In addition, since the indicator function is a polynomial, it can be incorporated into 
the theory of computational algebraic statistics naturally. For example, the design ideal 
/(J-") is simply written as 

/(^) = (x?-l,...,x^-l,/(x)-l), 

where /(x) is the indicator function of J-". In other words, the indicator function forms a 
set of defining equations by itself. 

We list some characteristics of the indicator function. The indicator function of the 
full factorial design V is /(x) = 1. The constant term of the indicator function of a 
fractional factorial designs is equal to the fraction n/2'". The indicator function of a 
regular fractional factorial designs is simply written as a product of its defining relations 
(see [H] or [2Z])- For example, the indicator function of the 2''"^ design in Example 12.11 
is written as 

/(x) = -^(1 - xiX2a;3)(l - a;ia;4a;5)(l - X2X4X6){1 + XxX2X^X'j). (7) 
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The absolute values of the coefficients in the indicator function do not exceed the constant 
term. In particular, the absolute values of all the coefficients in the indicator function of 
regular designs coincide with the constant term. 

Concerning non-regular designs, one of the results on the coefficients of the indicator 
function of a non-regular design is related to the existence of a regular design including 
the non-regular design. 

Example 4.1 (The indicator function of the non- regular fractional factorial designs). 

Consider the following three fractional factorial designs. 
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J^i is a 2^^^ regular design defined by X1X2X3 = 1, and T2 is a non-regular design which is 
a proper subset of T\. J-3 is also a non-regular design, but there does not exist a regular 
design which includes J-3 as a proper subset. The indicator functions of these three designs 
are given as follows. 

J"! : /(x) = i + ixiXaXg 

J^2 ■ /(x) = I + +X2+X3- X1X2 + X1X3 + X2X3) + ^XiX2X3 

00 o 

J's : /(x) = - + -(xi + X2 + X3- X1X2X3) 

An important observation is that the terms whose coefficients are equal to to the constant 
term in the indicator function of ^2 (i.e., X1X2X3) coincide with the terms of the indicator 
function of T\, and there are no such terms in the indicator function of J-3. These 
characteristics of indicator functions hold in general. 

The absolute value of a coefficient in the indicator function represents a complete 
confounding relation if it is equal to the constant term, and a partial confounding relation 
if it is smaller than the constant term. Considering this point, we gave in [3] a new class 
of two-level non-regular fractional factorial designs, called an affinely full-dimensional 
factorial design, which has a desirable property for the identifiability of parameters. We 
present it briefly in Section [6l 



5 Indicator function for adding factors 

In this section, as a simple application of the indicator function, we consider the design 
ideal for adding factors. The additional factors may be real controllable factors, whose 
levels are determined by some defining relations. For the purpose of Markov bases in 
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Section [7] the additional factors are formal and correspond to interaction effects included 
in a null hypothesis. 

Let T\ be a fractional factorial design of the factors xi,...,Xrrt. Consider adding 
factors Hi, . . . ,yk to J^i. We suppose the levels of the additional factors are determined 
by the defining relations among xi, . . . , as 

yi = eix^\ ■■■,yk = efcx''\ 

where ei, . . . , G {—1, 1}. Write this new design of Xi, . . . , x^, yi, . . . ,yk as J-2. The run 
sizes of J^i and J-2 are the same. 

Let /i and /2 be the indicator functions of J^i and J-2, respectively. Then we have 

/2(xi, . . . , Xm, yi,...,yk) = ^{1 + eiyix^^) ■■■(! + ekyk^^'')fi{xi, . . . , x^). (8) 

In fact, for (xi, . . . ,Xm,yi, ■ ■ ■ ,yk) e J^2, {xi,..., Xm) e Tx and 

ei?/ix''i = . . . = Ckyk^^" = 1 

hold, which yields /2 = 1. Conversely, if (xi, . . . , x^, . . . , ?/fc) ^ J'2, then (xi, . . . , x^) ^ 
J^i or some of eiyix^^, . . . ,ekyk^^'' has to be —1, which yields /2 = 0. Note that ([HD 
generalizes the indicator function of regular fractional factorial designs ([7]), by taking 
/i = 1, i.e. by assuming the full factorial design for Xi , . . . , Xm- 
From the above result, we have an expression of I{J^2) 

= {xl-l,...,xl-l,yl-l,...,yl-l,h-lj2-l). 

If we fix the term order r on Xi, . . . , x^ and a on Xi, . . . , x^, yi, ■ ■ ■ , yk, EstT-(J-'i) and 
Esto-(J^2) are defined. Estr{J^i) and Esto-(J^2) contain the same number of monomials 
since the run sizes of J-'i and J-'2 are the same. In particular, if we use a term order a such 
that {yi, . . . ,yk} {xi, . . . , x^}, then Est^(J'i) = Est<^(J'2) holds. 

6 Consideration of selection of optimal designs 

This section is based on two works by the authors related to optimal selection of non- 
regular designs. 

In p] we defined a new class of two-level non-regular fractional factorial designs as 
follows. 

Definition 6.1 (Definition 2.1 of [3]). A non-regular fractional factorial design J-' is called 
an affinely full- dimensional factorial design if there is no regular fractional factorial design 
J-"' satisfying T C J^' . Conversely, a non-regular fractional factorial design J-" is called 
a subset fractional factorial design if there is some regular fractional factorial design T' 
satisfying T ^T' . 

At a glance, the merit of this definition might not be clear. One of the properties of the 
affinely full- dimensional factorial design is the simultaneous identifiability of the param- 
eters. In fact, if T is an affinely full-dimensional factorial design, then the parameters of 
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the main effect model are simultaneously identifiable and vice versa. See Lemma 2.1 and 
Lemma 2.2 of [3]. As another interesting property of the affinely full-dimensional factorial 
design, we have the following conjecture (Conjecture 3.1 of ^) on the D-optimality of 
affinely full-dimensional designs. 

Conjecture 6.1. Consider the main effect model for the observations obtained in a frac- 
tional factorial design of m factors. Then D-optimal design is affinely full- dimensional 
factorial if and only if m = 5,Q,7 (mod 8 ). 

Though this conjecture is not proved in general, it is shown to be true when some 
known bounds for the maximal determinant problem ([7j, [H] and [26]) are attained. See 
[3] for details. 

In [2], we considered more realistic situation that the model is unknown. In this 
case, we cannot rely on model-based criterion such as D-optimality and have to evaluate 
the model- robustness. In [2], we considered the situation where (i) all the main effects 
are of primary interest and their estimates are required, (ii) an experimenter assumes 
that there are certain number of active two-factor interaction effects and certain number 
of active three-factor interaction effects, but it is unknown which of two- and three- 
factor interactions are active, and (iii) all the four-factor and higher-order interactions 
are negligible. This is a natural extension of the setting considered in [10]. In [3], we 
presented some optimality criteria to evaluate model-robustness of non-regular two-level 
fractional factorial designs. Our approach was based on minimizing the sum of squares 
of all the off-diagonal elements in the information matrix, and considering expectation 
under appropriate distribution functions for unknown contamination of the interaction 
effects. We also compared our criterion to a generalized minimum aberration criterion by 
[12] and affinely full dimensionality. 

7 Markov bases and conditional tests by Markov chain 
Monte Carlo method 

In this section we introduce another topic of application of Grobner basis theory to the de- 
signed experiments, called Markov bases for designed experiments. The notion of Markov 
bases was introduced in [13]. They established a procedure for sampling from discrete 
conditional distributions by constructing a connected Markov chain on a given sample 
space. Since then many works have been published on the topic of Markov bases by 
both algebraists and statisticians. This constitutes another main branch of the field of 
computational algebraic statistics. See [5] for the history of this topic. It is of interest 
to investigate statistical problems which are related to both designed experiments and 
Markov bases. In [B] and [1] we considered applying Markov bases for discrete observa- 
tions from designed experiments. In this section we review the results of these works. 

Suppose we have nonnegative integer observations for each run of a regular fractional 
design. For simplicity, we also suppose that the observations are counts of some events 
and only one observation is obtained for each run. In this case it is natural to consider 
the Poisson model, in the framework of generalized linear models ([19J). Write the ob- 
servations as y = {yi, . . . yyn)', where n is the run size and ' denotes the transpose. The 
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observations are realizations from n Poisson random variables Yi, 
tually independently distributed with the mean parameter /Zj = EiYi) 
express the mean parameter /Xj as 

log/ij = f3o + f3iXii H h f3^-iXi^-i, 



where (3 = (/3o, • • • , /Siz-i)' is the //-dimensional parameter and xa, 
V — 1 covariates. We write the covariate matrix A as 



, Yn, which are mu- 



,n. We 
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Note that the expression iQ can be treated as the null model Hq. Since the saturated 
model has n-dimensional parameter, various goodness-of-fit tests with the saturated model 
as the alternative Hi can be written as 

Ho: (/3,,...,/3„) = (0,...,0) 
Hi: (/3,,...,/3„)^(0,...,0) 

by introducing additional parameter {(3^, ■ ■ ■ , f3n)- Various other hypotheses can be written 
in a similar way. Under the null model ([9]) the sufficient statistic for the parameter P is 
given by A'y = (^"^^ yi, ^"^^ xny^ . . . , Y.7=i ^iv-iVi)' ■ 

Once we specify the null model and a test statistic, our purpose is to calculate the p 
value. In this stage, Markov chain Monte Carlo procedure is a valuable tool, especially 
when the fitting of the traditional large-sample approximation is poor and the exact cal- 
culation of the p value is infeasible. To perform the Markov chain Monte Carlo procedure, 
the key notion is a Markov basis over the conditional sample space given the values of the 
sufficient statistic 



{y I A'y = A'y°, yi is a non-negative integer, i = 1, . . . ,n}, (11) 

where y° is the observed vector. Once a Markov basis is calculated, we can construct a 
connected, aperiodic and reversible Markov chain over the space in (fTTl) . which can be 
modified so that the stationary distribution is the conditional distribution under the null 
model by the Metropolis-Hastings procedure. See p!3] and [T^ for details. 

In the arguments above, an important step is to construct a covariate matrix A in ffTOl) . 
In this step, we have to construct A so that all the parameters in ([9]) are simultaneously 
estimable. This problem corresponds to the ideal membership problem in Section [31 

We illustrate the above setup with two examples. 

We ffist consider a 2''"'^ fractional factorial design. Suppose we have observations 
y = {yi, . . . , yi^y for each run of the fractional factorial design with the defining relation 

XiX2XiX^ = X1X3X4X6 = X2X3X4XY = 1. (12) 
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The design and the observation is written as follows. 
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In this case, there are several models to be considered. If our interest is only on the main 
effects for seven factors, we may define 
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(13) 

i.e., the covariate matrix A is constructed as the design matrix and the column vector 
(1, . . . , 1)'. In this case, the parameter Pj is interpreted as the parameter contrast for the 
main effect of the factor xj for j = 1, . . . ,7. 

We can also consider models containing interaction effects. For example, if we want to 
estimate the two-factor interaction effect among xi and X2 along with all the main effects, 
we may add the column 

(1, 1, 1, 1, -1, -1, -1, -1, -1, -1, -1, -1, 1, 1, 1, 1)' (14) 

to the above A. Note that this corresponds to adding a factor in Section |5l In this case, 
P is 9-dimension and the element corresponding to the additional row f lT4|) is interpreted 
as the parameter contrast for the two factor interaction effect among xi and X2. 

If we want to estimate another interaction, we have to consider the confounding rela- 
tions. For example, two two-factor interaction effects among xi x X2 and x^ x x^ cannot 
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be estimated simultaneously since they are confounded. This confounding relation is 
shown in (fT2|) . In the terms of algebra, this confounding relation is shown as the ideal 
membership, 

X1X2 — X^X^ G (x^ — 1, . . . , X7 — 1, X1X2X4X5 — 1, XiX^XiX^ — 1, X2X^XiXi — 1), 

as discussed in Section [31 

As our second example, we consider the case of three-level factors. We also indicate 
how complex coding simplifies the specification of the conditional sample space fill I) . 
Consider the following 3^~^ fractional factorial design, where the levels are coded as 

{0,1,2}. 

Factor y 
Run xi X2 X3 

1 yi 

2 1 2 ?/2 

3 2 1 ?/3 

4 1 2 ?/4 

5 1 1 1 1/5 

6 1 2 ?/6 

7 2 1 

8 2 10^/8 

9 2 2 2 ?/9 

In this case, each main and interaction factor has more than one degree of freedom and 
has to be parameterized by more than one parameters. For example, the main effect of 
Xi can be expressed (01,02,0:3) with one constraint. One of the simplest constraints is 
03 = 0, i.e., to treat X3 as the baseline, which is expressed as the two columns 

/II 1000000 y 

VOOOlllOOOy) 

in the covariate matrix A. We can also consider a symmetric constraint Oi + 02 + 03 = 0. 
In this case, we may include the two columns 

/ 2 2 2 -1 -1 -1 y 
1^ 2 2 2 -1 -1 -1 y 

to A. Note that the conditional sample space (fTTj) is invariant for the constraints since 
the total is fixed. As for the interaction effects, see 
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Another equivalent expression is given directly from the complex coding 
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where a; = exp(27ri/3). If we allow A to be a complex matrix and consider the real and 
the complex parts, the conditional sample space ( fTTl) defined from 
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is also the same. This follows from the following basic fact of the theory of cyclotomic 
polynomials (e.g. Section 6.3 of [H]): for qi,q2,(l3 £ Q 

qi + q20J + gsw^ = ^ qi = q2 = q3- 
The same result holds for s-level factors, where s is a prime number. 



8 Some discussions 

In this paper, we review several topics related to designed experiments and computational 
algebraic statistics. As we have stated, there are two works as the beginning of computa- 
tional algebraic statistics, i.e., the work by Pistone and Wynn (|2l]) and by Diaconis and 
Sturmfels ([I3]). It is important to study whether a closer connection can be established 
between these two branches, i.e., the design ideal and the Markov basis (toric ideal). Our 
works [1] and [6] are motivated by this goal, although we do not yet have a result of some 
general nature. 

The following argument suggests that there should be some general results relating 
these two branches. Recall the covariate matrix A in (fT3l) . For the main effect model, 
except for the first element 1, each row of A is just a point in the design and therefore A 
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itself can be considered as a design. In the theory of toric ideals, the set of rows of A is 
often called a configuration defining the toric ideal. Therefore for the main effect model, 
the design simultaneously defines the design ideal and the toric ideal. Note that this 
relation also holds for three-level factors (or more generally for prime number of levels), 
if the levels are coded by complex numbers as indicated at the end of Section [71 

If we include some interaction effects in the null model for two-level case, this corre- 
sponds to adding factors as in Section [51 Therefore again there is a very simple relation 
between the design (without added factors) and the configuration for the toric ideal (with 
added factors) . This argument suggests that some algebraic properties of the design ideal 
should be reflected in algebraic properties of the toric ideal. This is an important topic 
for further research. 
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